1 CCTCGGCGGC CCGCATCTGC CCCCGCGCGC CCGCCCTGAG CCCGCCCCGA 
51 CTGGGCAGGC GGGGGAGCCC CTACTTCTCT CCCCCCGGGC GGGGGAGCCG 
101 GGGGGCAGCG CCGGAGCCCG GGGGGAGCTC AGCCCCGCCG ACCGGCCGGC 
151 CAGGGCAGGG GGCAGCTAGG ACGGCCCCGG TCCAGGTGGA GGCCGCAGAG 
201 GGCCCAGGGC AAGCAGAGGC AGCAATGGTT GGTCCTGACG GTGGCTGAGC 
251 CCCCAGCCCC TGGAATATGC AGCCCGGGGG AGCCCCAGAC AGCGGCAAGG 
301 ACGAGGTGGC GGAGTGGGGC GGGAGGCATG GTCTCCACCT ACCGGGTGGC 
351 CGTGCTGGGG GCGCGAGGTG TGGGCAAGAG TGCCATCGTG CGCCAGTTCT 
401 TGTACAACGA GTTCAGCGAG GTCTGCGTCC CCACCACCGC CCGCCGCCTT 
451 TACCTGCCTG CTGTCGTCAT GAACGGCCAC GTGCACGACC TCCAGATCCT 
501 CGACTTTCCA CCCATCAGCG CCTTCCCTGT CAATACGCTC CAGGAGTGGG 
551 CAGACACCTG CTGCAGGGGA CTCCGGAGTG TCCACGCCTA CATCCTGGTC 
601 TACGACATCT GCTGCTTTGA CAGCTTTGAG TACGTCAAGA CCATCCGCCA 
651 GCAGATCCTG GAGACGAGGG TGATCGGAAC CTCAGAGACG CCCATCATCA 
701 TCGTGGGCAA CAAGCGGGAC CTGCAGCGCG GACGCGTGAT CCCGCGCTGG 
751 AACGTGTCGC ACCTGGTACG CAAGACC TGG AAGTGCGGCT ACGTGGAATG 
801 CTCGGCCAAG TACAACTGGC ACATCCTGCT GCTCTTCAGC GAGCTGCTCA 
851 AGAGCGTCGG CTGCGCCCGT TGCAAGCACG TGCACGCTGC CCTGCGCTTC 
901 CAGGGCGCGC TGCGCCGCAA CCGCTGCGCC ATCATGTGAC GCCTGCGCGC 
951 CCCTCGGGCT GCACCGGCAC TGGCCGAGCG GAGGGCGGGG CCGTACTGCG 
1001 GGGCTGGGGC GGGGAGCGGG CGGGAAATGG AACTGTGACG GTCCCGGCCT 
1051 GAGGCCCCTG CAGCCACGCA CCTCCCGGTG AGAAGCAGAG CGCGAGAGGG 
1101 AGCCCTCCGT AACTGCCCAG CCCTGCCCCT TGCCCCCGTG GCTTCCTGGG 
1151 ACAGCCGCCT TCAGTGCTGT ATTTAGTGCA GTGCCCGGCC CGACCCGCGG 
1201 GGGTGCCACA GCCTTTTGGG ATGGGGGTGA GCGTGCAATG GAGGCTGGGG 
1251 GTGGCGAGGT GCCGCCTTGG CCGGGCCCCC ACGTGTCTTC TCCAGAATGT 
1301 GTCTGTCTTT GCCTGGTGTC TTCCTTTCCC GTGTCCGCCC ACCCCAGCGT 
1351 CTGTTGGTAC TTACCTGTCT CACCTACCCT CCAGTCCCCT CCCAGCTCCG 
1401 CTCACAGGGC TCTCATTTCG TCCATCCCCT TGTCGCAGAT CCTGGCAGCT 
1451 TCTTTGTGAG GCCAGGCCTT CTGAC TGTC A GCACCACCGG CACAGGGCAG 
1501 AGATGCGGGT GGCCCAAGGA CCACGATCAA GGGGTCCGGG GGACCGAGGT 
1551 CCCAGATCAG TGAGGGGAGA AGGTTGAGCT CTCCGGCTTC CAGGGAGACC 
1601 TCCCCGCCCA GCAGCCCCCA GAGACACAAC AACCTACCTT CCAGCCTTAA 
1651 CTCGATGGTC CGTCCCTGCC AGGTGCCCCT CACTCTTCCT GACCCCAAAG 
1701 CCAGATCACC CCCTGGGTTA AAACTTTTTT TCTTTTTTTT TTTTGGACAG 
1751 AGTGTGGAAA GGGAGCCCCC CAAAGGATAG CTTCTTTTTC ATGATGCCAG 
1801 GCTCCAGTCC TTTATTCCCT TCTGCATACT GCAATCTGAT CTGTCAGACT 
1851 GGGGAATGTT GGGTTCTGGG GTCTGGTCGT GGGCAGGATG GTGCCCAGAA 
1901 GGGGGTTAGG TTGTCCCAGT GAAAATTCTG TTGCCCCGTC TCAACCCCAT 
1951 CTGACTACCC CAGACTCTGC CTGCCTCAGA TCTCAGACTA TCCTGATTAA 
2001 TCTGGGGAAG AACAGAGCCA GGGAAAGAAT GGTGGGGACC CCTGTACTTG 
2051 GGGGAGACAC ACCTGCATCT TCCTCCTGCC ACAGATGGAG GCCCTCAGGA 
2101 TCTGACACCC TCTTGTCCCA ACACCAGTCA GCCCTATACC CTAACTCACT 
2151 CCACCCCATT TTCTCCGGCT GCCTGGCCGG GTTTCTACCT CTCGTCACCG 
2201 GAGCTGATCA CTGTCAGTTT TGTACCGATT TAGAAATAAC AATAATAATG 
2251 AAGATTCTAG GAATGGCATG AGGGATTGAT GGGGGACTTG GAGGGAGGGA 
2301 CAAGTGGTGC CCTGTCCCCT GCTCCCCTGG CCAAAGAAAG CTGTCCTTGA 
2351 GGCTGAGCCC TCAGCCCTGG CCTGGTGGGG GGACAGCAAG GTCCCTTGTT 
2401 ATAAGAGGGG CAGAGAGGAC AACTCCGCTT TGGCCAACCT AGCCAAGGCT 
2451 GCAGCATATA GACCAGGAAA TCAGGTAGCC C AG AC TGGTG ATGGAGCAGA 
2501 GTCTGGGGGA AGGGTCGTGG GTGGGGAATT TATCACCAAC ATCCATTGTA 
2551 GGGGGAATCT ATGATTC TGC TTCCCCAGCG GATTCCCACT CTGTCCACCA 
2601 AGTGGGGGGT AGCACAGCCT CACAGCAACC GCCCTGACCT TGGGCAGTCT 
2 651 AGTGTTCCTG CATTCTAGTC CCTGCTGTGC TGCAGGACTT TGGGCAAGTG 
2701 ACCTGCCCTC TGTGAGCCTC CCTCTGACAC AGAGGAGGTG GCTCCCCTTC 
2751 CCCACACCTT AGAGTGGCTG GGAGGGTAAC AAAGAGGGCC TGCCCCTTTA 
2801 GTCTCCTGCA CCCCTGCCCC CTGGTTCACC AGAGGGAGCG GATGAAGGAT 
2851 GGCAGCATCT CACATGCCCC ATCACCAACT CTGAGGCACC TGGGGTGGGG 
2901 GGGCGGAGCC CAGGCCTCTG GCTGCTCCCC TGTGGGAGCC ATTGGAATGT 
2951 ATCCCCTGAC AGGCCCCCTT CCGCCTCCAC CTCAACCCAG GTCTTGGATT 
3001 TCAGGTCCCT CCACCCCCAT TCTGAGTCTC TGTCCTTCTC CTTCCACCCG 
3051 CTCCCAGGGT TTCCCACCAC AGGGTC TGGA AGTGTGTGTG ACGCCCATTG 
3101 AGCTGTTACC CGAAGTCAGA TTAAAAATCA GGGAGTGTTT TCCCTCGTTT 
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3151 CTGTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA (SEQ ID N0:1) 



FEATURES: 

5'UTR: 1-327 

Start Codon: 328 

Stop Codon: 937 

3'UTR: 940 



Homologous proteins: 

Top 10 BLAST Hits 



/altid=gi | 
/altid=gi | 
/altid=gi | 
/altid=gi | 
/altid=gi | 
/altid=gi j 
/altid=gi j 
/altid=gi | 
/altid=gi | 
/altid=gi j 



CRA| 18000005055410 
CRA| 18000004990267 
CRA| 18000005229906 
CRA|18000005189812 
CRA|89000000193051 
CRA|89000000198900 
CRA| 87000001027207 
CRA|40000057438157 
CRA| 18000004932656 
CRA| 18000004905102 

BLAST dbEST hits: 

gi | 10220573 /dataset=dbest /taxon=96... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 
gi | 10220573 Lung small cell carcinoma 



5454030 /def=ref | NP_006468 . 1 | RAS- . 
7438395 /def =pir | | T15833 hypotheti . 
4884048 /def=emb|CAB43324.l| (AL05. 
9297040 /def=sp|094363|RHEB_SCHPO . 
7290026 /def=gb|AAF45493.l| (AE003 . 
7296750 /def=gb|AAF52029.l| (AE003 . 
7323471 /def=gb|AAF59545.l| (AC024. 
10119859 /def=dbj |BAB13483.1| (ABO. 
6981476 /def=ref |NP_037348.1| Ras . 
5032041 /def=ref |NP_005605.1| Ras . 



Score 


E 


199 


le-49 


89 


3e-16 


86 


2e-15 


85 


3e-15 


84 


le-14 


83 


2e-14 


82 


4e-14 


81 


7e-14 


81 


9e-14 


81 


9e-14 


444 


e-122 



Expression information from PCR-based tissue screening panels: 
Whole brain 
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1 MVSTYRVAVL GARGVGKSAI VRQFLYNEFS EVCVPTTARR LYLPAWMNG 
51 HVHDLQILDF PPISAFPVNT LQEWADTCCR GLRSVHAYIL VYDICCFDSF 
101 EYVKTIRQQI LETRVIGTSE TPIIIVGNKR DLQRGRVIPR WNVSHLVRKT 
151 WKCGYVECSA KYNWHILLLF SELLKSVGCA RCKHVHAALR FQGALRRNRC 
201 AIM (SEQ ID NO:2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASNJ3LYCOSYLATION 
N-glycosylation site 



3 105-107 TIR 

4 150-152 TWK 

5 159-161 SAK 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



142-145 NVSH 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches : 5 

1 4-6 TYR 

2 37-39 TAR 



70-73 TLQE 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of matches: 2 

1 14-19 GVGKSA 

2 178-183 GCARCK 



[53 PDOC00266 PS00294 PRENYL AT I ON 
Prenyl group binding site (CAAX box) 



200-203 CAIM 



[6] PDOC00017 PS00017 ATP_GTP_A 
ATP/GTP-binding site motif A (P-loop) 



11-18 GARGVGKS 



FIGURE 




BLAST Alignment to Top Hit: 

>CRA| 18000005055410 /altid=gi | 5454030 /def =ref |NP_006468 . 1 | 

RAS -related on chromsome 22 [Homo sapiens] /org=Homo 
sapiens /taxon=9606 /dataset=nraa /length=203 
Length = 203 

Score = 199 bits (501), Expect = le-49 

Identities = 105/204 (51%), Positives = 134/204 (65%), Gaps = 1/204 (0%) 
Frame = +1 



Query: 328 MVSTYRVAVXGARGVGKSAITOQFLYNEFSEVCVPTTARRLYLPAVVMNGHVHDLQILDF 507 

M + RVAVLGA GVGK+AI +RQFL+ ++ E PT RLY PAV+++G V+DL I D 
Sbjct: 1 MGGSLRVAVLGAPGVGKTAI IRQFLFGDYPERHRPTDGPRLYRPAVLLDGAVYDLS IRDG 60 

Query: 508 PPIS-AFPWTLQEWADTCCRGLRSVliAYILWDICCFDSFEYVKTIRQQILETRVIGTS 684 

+EW D L+ A++LVYDIC DSF+YVK +RQ+I ETR G 

Sbjct: 61 DVAGPGSSPGGPEEWPDAKDWSLQDTDAFVLVYD ICS PDSFDYVKALRQR I AETRPAGAP 120 

Query: 685 ETPI I IVGNKRDLQRGRVI PRWWSHLVRKTWKCGYVECSAKYNWHILLLFSELLKSVGC 864 

E PI++VGNKRD QR R PR ++ LVR+ W+CGY+ECSAKYNWH+L LF ELL+ 
Sbjct: 121 EAPILWGNKRDRQRLRFGPRRALAALVRRGWRCGYLEC SAKYNWHVLRLFRELLR-CAL 179 



Query: 865 ARCKHVHAALRFQGALRRNRCAIM 936 

R + H ALR QGAL RC++M 
Sbjct: 180 VRARPAHPALRLQGALHPARCSLM 203 (SEQ ID NO: 4) 



Hmmer search results (Pf am) : 

Model Description Score E-value N 

PF00071 Ras family ~ 44.3 4.9e-12 2 

CE00060 CE00060 rab_ras_like 11.1 0.054 1 

PF01118 Semialdehyde dehydrogenase 4.4 1.4 1 



Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF01118 1/1 ' 5 14 . . 1 10 [. 4.4 1.4 

PF00071 1/2 6 36 . . 1 31 [. 22.2 1.7e-05 

CE00060 1/1 87 138 .. 99 150 .. 11.1 0.054 

PF00071 2/2 89 161 76 157 . . 22.1 1.8e-05 
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1 GGCGAACGTG GTTGGAACAA ACAGCTGGCA GACTTGTGAC CCGGCCCTCG 
51 GGATCCGCGA AGCCCCCGCT CTCAGCCTTG GGCAGCGACC CAGGTGTCCA 
101 GACGCAGGGA AGGGGACGGA ACCAGGCTTC GCCCCCTGTG TGTGTCTCTG 
151 GTCTCTTGCC TCTCTTTCCG GGCAGTCTCT TGCCGGGCGC TGTCTGCAGA 
201 CTCACTGCAG AGCGCAGGCC TTGGGGAGAA AGCGCTCAGG GGCCTGGGCC 
251 CTGCTTCCTG GGACAGCCCC CTCCCTTCCG CACTCAGCCA AGGGTGTTGG 
301 ATTAATATAC TCTTAGATCC AGGACCTCTG CCTGGAAAGA AGGAAGGGGC 
351 ACCGATCGCT TCTGAGTTGG GGACAGGGCC ACACTTGGAC CTGATGAGTT 
401 CCAGGAGGCT GGCCCCAGCC TAGGGGCCTC CTGCACCCCC TTCCTAGCTC 
451 CTGGGTAGTG CCCCTCTTGC ATTTCCCTGC CGCTCCCCAG AAAGGGCTGG 
501 TTTCTGGGCA GGGAGTGTGG TGTCCGCGAC AATCTGCAGG TAGGGTGCAT 
551 CCTAGAAATC TCTGAGCTGC CACACCCAGA GAGAGGCCAG GACTCTTCTT 
601 GGCGTTCCTT GCCCTCTCTT CCTCCCCTAG GACTCCCCAC TCCCCACCTG 
651 CCTGTACTGG TTATGTAATT AACCCAGCTG AGTCTCTCTG CCATGCTGGT 
701 GGTAGTGGTG TCCAGGGATG CCTAAGGGGC TTCGGGACCT GGAGGGACAC 
751 ATGGGGAGGA GGGATAGGTA CTTCCCCCTA GTTGGGAGCC CATGTAAGTG 
801 TATAAACACC TGTTGGAGAA GGGACACTGC ATGGGGCAGG AAGGAGTTCA 
851 GGGTCCTAAT CTTAGTTAAC AACTCAACTG GCTATGAGGT CTTCACTTCC 
901 C TAAGTC ACT GGGTTTGTTT TTTGGGGGGT TTGTTTGTTT TTTAAACCTT 
951 CAATAAAATG AGAAGAATCT CTCTCACGCA CACTGCATGC ATGTGAGCTC 
1001 ATGCAACAGT AGATAGCAGC CCCTCCTGGG ATTGCTAGGA GGTTCCACTG 
1051 CTAAGTTTGT GGGAAGCTTA GATGAATGAA TGACCCCTGC CCATAGCAGT 
1101 GGTCACTGTT TATTGAACAC CTACTCTGGC CTAGGCACTG GGCTTTACAT 
1151 GCACACATCA TAACACTTAA TAAAGTTTCT TGCCCAAGGT CACACTTCAG 
1201 GTGAATGGCA GAGCTAGGAA TCAAGCCCTG GCGGTTCTCA TTCCACATCC 
1251 TGACCTGTAT TTACTATGCT GTGTGCGCAC ATACCCCGTG GAGTGAGTCC 
1301 TCTGAACCAG GACAACCTGG GGGACATTCA GCTGTGGCTT GTGCATATGT 
1351 GGAAAGAACA TGCATCTGCA GAGAAGTATG CACCCATGCG GAAAAGCATG 
1401 GCCTTGCCAG AATCCGGCGA ACCCACCTGC CCTGGAGCCA GTTAGCAAGT 
1451 GCTGATGTCA CCCTCCTGGG AATCCAGACA GGAGGTCTCA GGCATGTAAC 
1501 AGCCTCTCTG GTCACCATCA CCAACAAAAG AAGAAAGGTT CTCTCCTTTT 
1551 CCTGGCTAAG AGATTTACTT GGATTTTCTC AGAAGTAGGG GCTGTTGACC 
1601 TCATTTTACC TATGTGCAGA AGCATGGCTC CAGTTGAGAA GGTGATTTGC 
1651 CCACTGACCT GCAGCAAGGA CACACCAGAG C TGATG AATT CGTATCAAGG 
17 01 CCCAAACCCC TAGCAGCCCT TTTCTGGGCA CCTGCTATGT GCCAGGCCCT 
1751 CTGGGCTCTA GGGGATGCCA GAGAATCAGA CCCAGACCTT GGCCTATATG 
1801 CATTCACTGG TTAGAATCAC TGAACTTGGC CAGGCGCGGT GACTCACGCC 
1851 TGTAATCCCA GCACTTTGGG AGGCCAAGGT GGGTAGATCA CCTGAGGTCA 
1901 GGGATTCCAG GCCAACATGG CGAAACCCCA TCTCTACTAA AAATACCAAA 
1951 ATTATCTGGG TGTGGTGGCA GGTGCCTGTA ATCCCAGCTA CTCGGGAGGC 
2001 TGAGGCAGGA AAATGGCTTG AACCCAGGAG GTGAAGGCTG CAGTGAGTCG 
2051 AGATCATGCC ACTGTACTCC AGCCAACTTG AAAGTTGGAA GAAGTACTGC 
2101 GACATCGTAG AAAGGAGTAG CTGCCCTTCG GGTTGTGATT GGGCACACTT 
2151 CTTACTCCTT CTAAGCTTCT CTCTAAAGTG GGAATAACAG TACTTATCCC 
2201 TTAGAGATGT CGTACTGACC AAGGATAACC ATATGGGCAT GCCAGCAGAG 
2251 TAAGTGCTCA GCAGTGTTGC ACCCCAGAGG CGACGTGGTC ATCTAGTGCA 
2301 GCATTCTCAA CTAGGGGCAT TTGGAAATGG GGGCAGGGGG AGTTTTTGGT 
2351 CATCATGGTG TCTGGAGGCT GGCAGTTACC TCTTGGGGGC CACGGAAAGC 
2401 AAATGTCCTG CAATGTGTGG GCAGTCCTGC ACAATCGAGT TCTCTCCCCA 
2451 CAAAATGTCA GGAGTGCCAG CACTGAGACA CACCTGCCCA GTCCCACCCA 
2501 TTCAGGAGGA CACAGACTCA GAGGTGTTGC CGTCTTGTCC CAGGCTCTGT 
2551 GGGGAAGCTG GGATCAAACC AAGTCCAGTG CGCTTCCCAC TCTGCTCTGC 
2601 AGCCTGTTTT GGTTGGAGTT GGACC TGGAG AAAAGTCAAG TCATAAGTCA 
2651 AGAAAGATTG GGCCCTACTA CTGGAATGCA GGAAAAAATG GAGGAGGGAT 
2701 GGAGAGGTTT TGGAAAGGCA GCCACAGGGG TTC TGGG AG A GGGAAGGCAT 
2751 TCTAAGTGGC AGTAACAGCT TCAGCAAAGT CCCAAAGGTG GAAAAGTGCA 
2801 GGACACGTCC AGGGATAAGC CAGTGCACTA AGCCCACCTC TTGTCCCCAC 
2851 AGTCCAGGTG GAGGCCGCAG AGGGCCCAGG GCAAGCAGAG GCAGCAATGG 
2901 TTGGTCCTGA CGGTGGCTGA GCCCCCAGCC CCTGGAATAT GCAGCCCGGG 
2951 GGAGCCCCAG ACAGCGGCAA GGACGAGGTG GCGGAGTGGG GCGGGAGGCA 
3001 TGGTCTCCAC CTACCGGGTG GCCGTGCTGG GGGCGCGAGG TGTGGGCAAG 
3051 AGTGCCATCG TGCGCCAGTT CTTGTACAAC GAGTTCAGCG AGGTCTGCGT 
3101 CCCCACCACC GCCCGCCGCC TTTACCTGCC TGCTGTCGTC ATGAACGGCC 
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3251 
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5801 
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5901 
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ACGTGCACGA 
GTCAATACGC 
AACGGATGGG 
CTATTGCCAG 
GGCCACTCTG 
TTATTCAACA 
GGGCACTGGT 
GAAACTCATA 
GATTGGAGGA 
TCCAGATGTG 
CTAAATAGGT 
ACTAGGCTAC 
CATCCTCTGG 
TGTGTTCTTG 
CTCTTCTCTT 
GCACCCTCCC 
TGTGTTCCTT 
AACCTCTGAC 
ACATGTTTGC 
CCAGGAAGGA 
TCGTCCAGGT 
GGGCTGGGAG 
GGCAGGAGGG 
AGGCACCTGG 
TGGAAGCAGT 
CTTCTAGTTC 
CAAATCTTGC 
CCCTTTACCT 
TTTCTTTCTT 
CCTCTGTCGC 
TCTTCGCCTC 
CTGGATTACA 
TAGAGATGGG 
TCAAGTGATC 
GAGCCACCAC 
AGTAAGAAAA 
ACAAGAACCA 
TTTGTATAGA 
GCCTCAAGCG 
TGTGAGCCAC 
ATAACCTGAA 
TTTTCATGGC 
TCTTGTTGGA 
CCACAATGAG 
TCTGTGGGAT 
ATCACAATGC 
TCATGCCTAT 
GAGCTCAGGA 
TACCAAAAAT 
CCAGCTGCTT 
GAGGCTGCAG 
GTGAGGCCCT 
GTTTGGAGGG 
CTGTGCTCCT 
CCATCAGAGT 
GATGAGATAT 
GGTGAAATTG 
TCAAGAACTA 
TTTTCTTCTT 
CACTCCTGGA 
GGTATCCTCA 
GCCACCATCT 
TAGAGCCCAG 



CCTCCAGATC 
TCCAGGTAGG 
TAGGGGAGAG 
GGCCCCATCA 
CTTAGAGCCG 
AATATTTGGT 
GCAGAATGGT 
GTCTGGTGAG 
GCAGGGAGCT 
TTGGGGTGGA 
TTTAGGCAGG 
TGTGAGTATT 
AGAAGGGACA 
GAGACTCTGG 
AAGTGATGCT 
TGCCAGGTTT 
CCCCACTGCC 
TTGAGAGGGA 
TCACTGAAGG 
CAGGCACAGT 
GTAACTGAGC 
ATGGCGGGCA 
CCCTGGCCTA 
GGTCAGGAGG 
GAGGCTGAGC 
TCTTCTAAAG 
TCCAAGCCTA 
TGATTCCTGC 
TTTTAAAATT 
CCAGGCTGGA 
CCGGGTTCAA 
GGCACCTGCG 
GTTTCACCAT 
CGCCTGCCTA 
GCCTGGTCTC 
TGCATTCTTC 
TACTGTTCTG 
TGGGTCTTGC 
ATCCTCCTGC 
TGTGCTTGGC 
GATTTTTCCA 
CACGCCCTAC 
TAAGTGGGTG 
TGGCCTTGTA 
CAATTACCGG 
CTGGAAATGG 
AATCCCAACA 
GTTCGAGACC 
ACAAAAAATT 
GGGAGGCTGA 
TGAGCTGAGA 
GTCTCAAAAA 
GAGGTTATTT 
GTGAGCAATG 
ATATTGTCAA 
GATATTCTCC 
AGCATCTTTC 
TTGATGTCCT 
GACTGACCCT 
GCCCAGTAAG 
AGGCACCAAA 
GGAAACAAAA 
GAGAGTAGTG 



CTCGACTTTC 
AGGACCCTGG 
GCTGGATTCC 
CTGAGTTTGG 
TTCCAGGAAT 
GACCGTTCAA 
GAGCAAAAAA 
AAAAGGCAAA 
GGGGCAGCCC 
GTTGGGGGGT 
TAAGAGTCAG 
CGGAGCTGTG 
GCAGCAGAAC 
AGACCCCAGT 
CTGCCCCTGA 
GGATTTAAAT 
TGCTGGAACC 
GTGGAGAGAA 
AAGCCTCTGA 
GGTGGAGGCC 
AAATCAAGCA 
GCTCAGCACT 
ATCTGCCGGA 
AAAGATGGAA 
CTGTCAGGGC 
GAGACTTTAA 
CACACTGAGC 
CCCACTCTCT 
TATTTATTTT 
GTGCAGTGGC 
GCGATTCTCC 
ACCACACCCA 
GTTGGCCAGG 
GGCCTCCCAA 
TTATCCATAC 
CCTGCTTCTT 
TACCTTGATT 
TGTGTTACCT 
TTTGGCCTCT 
GCTGTACCTC 
GGCCATTATC 
TCCATTGAAG 
GTATCCAGTC 
GATAGGTCAT 
AAAGGGCATT 
CATTGTGAAT 
CTTTGAGAGG 
AGCC TGGGC A 
AGCCAGGCAT 
GGTGGGAGAA 
GTGCCACTGC 
AAAAAAAAAA 
CCAGATTGCC 
TATATGAGAG 
ACTTTTGACA 
TTGTAGTCTC 
ATAGGTTTAA 
TTGCCCATTT 
GAGTTTTGGA 
GGACTTTTGG 
GATTAAGTAT 
TAATACATTC 
TTTCCTGGGT 



CACCCATCAG 
GGGGCATGGG 
AAACTGCTGT 
GAGCTCCACA 
CCATTCATTG 
TGTGTGCCAG 
TATATGGAAT 
TATGGTGTGA 
TTAAGGGGGC 
CACAGAGGGT 
TAGAGAAAAG 
CCTACCGTAA 
AGACGGGGCC 
CAGGTCTCTT 
CCTCAGGACC 
GCCTGAGGGT 
AGGTCCTCTT 
AAAGGAAGCT 
CCAGAGTGTA 
CAGAAGACAG 
GTCTCTCAGG 
CAGCACTCTC 
GACACCTGTT 
GCCTGATCCC 
AGACAGTCTG 
CAATCACCTG 
TTTGTTGATT 
ATAACCACTC 
TTTATTTTAG 
ACGATCTCGG 
TGCCTCAGCC 
GCTAATTTTT 
CTGGTCTCAA 
AGTACTGGGA 
TTTCAGTGTT 
ACGTAAAGAA 
TTATTTTATT 
AAGCTGATCT 
CAAAGTGCTG 
AATTTTTTTA 
TAGAGGACGT 
AGCTATACCA 
TTGTGCTGTT 
TTTGAACATA 
GCTGGAAATG 
ACAGAGCCAG 
CTGAGGCAGG 
ACATGACAAA 
GGTGCTCCAT 
TCGCCTGAGG 
ACTCCAACTT 
AAAGTGTGAC 
CTCCATAGCA 
CCCGTTTTCC 
ATATATTTGA 
CATTTGCATC 
GGGCCTTTGT 
TTCTATTGGG 
CTCTAAGATA 
CAGAGAAATA 
AAAACCTAAG 
TTCTCCAATG 
GTGAGCCTCA 



CGCCTTCCCT 
TTAGTGGGGA 
AGCTTGGGCC 
CTGCACCTTG 
GTGTGCTAGT 
GCCCTGCAGT 
TTGCTTTCAA 
TAAGTT£TAT 
ATCTAGGCCA 
GATGTCTCAA 
GACAGGGAAC 
CCTCACTCCA 
CTGGGAAAGG 
GCCCAAGGCC 
TGCCTGCTGG 
CCTCACTTAT 
GCCCTCTCTC 
GAGCTCTAGG 
CAGAGCTTTT 
GGGACAAGGC 
CTGAGACCCT 
GGCAACACCA 
CACCCATCCC 
GCATCTGCCC 
GATGCAGGGC 
ATTGGACATT 
TCATCTTGCC 
TTATCGAATT 
ATGGAGTCTC 
CTCACTGCAA 
TCCTGAGTAG 
GTATTTTTAG 
ACTCCTGACC 
TTATAGGCAT 
TCTTTACCCA 
CAAAACAAAA 
TTTAAAATTT 
CGAAC TACTG 
GGATTACAAG 
ACTTGCTATT 
CCTCATTCTT 
TGGAGTCCTT 
TCAAACAGTG 
AGTAGGTATA 
GCACTGCTGG 
GTGAGGTGGC 
CGGATCACTT 
ACTCCGTCTC 
GCCTGTGGTC 
CCGGGAGGTT 
GGGTGACAGA 
TGTAACTGGA 
GTGGCGTATG 
TACAGTCTTG 
CAATCTGAGA 
TCTGATCGTG 
GTTTCTCTTT 
TTGTTGGCTT 
TCCAAGATTT 
CTGTGAAAAA 
AATCCTGATG 
CCAGATGAGA 
GTGTCTTCTG 



FIGURE 3B 



6301 CAGCCCCTTC TATCAGAGAA 
6351 CACCAAAGGA ATTATCAGCA 
6401 CTTGTCCCTG AAACTGGACT 
6451 AGAGCCCCAA GATGGCCTTT 
6501 ATCAAGGGGG AAGGGATAAG 
6551 TTTTAGGAAC CCCCAGAGAG 
6601 TCAAGGAGGC AACATGTTGG 
6651 TGGCAGCTGC TCCCTCTCCC 
6701 CTCATCCCTC CCACTTCTCA 
6751 GAGCATCCGT ATACACAGGT 
6801 AGAATGCAGT CTTCCCTGGC 
6851 TTGTCCTCGG CATCCTCCCG 
6901 GAAAAGATGC CACATCATCT 
6951 AGCCTTTGAG TTGGGAAGTG 
7001 ACTGCCACCA AAGTCGTTTC 
7051 TAGTCCTCCT TTGTGCCTGG 
7101 CAGGAGACAT ACCTCCACCT 
7151 GAGAAGACTC TGGTTTCCTT 
7201 TAGGGACCCC CTTCTTTCCA 
7251 CTGGCACGTT GACTAATGAA 
7301 TGGCCTTTCT AGGAGTCTCT 
7351 AGAAGTTGCC ACAAACATTC 
7401 GGCCCTACTC TACATCAGCC 
7451 TTTCTCAACC CTGAGGGCAG 
7501 AGCAGTCTAA GGGCAGTGGC 
7551 GGCCATGGAG AGCTCACCAG 
7601 TTGAGCCAAC AGTGCTAGAC 
7651 TGAGGATGGC CTGGCTGTGG 
7701 AACTGGAGAT GGAACAGCTC 
7751 ACACAGGAGA TGAAGGGAGT 
7801 CAGGAAGCTT CTTGGAGGTG 
7851 AATTTTAACA GGTCCAACAC 
7901 TAATAAATGT ACTTGTTGAA 
7951 ACATGGAGCA AGGCAGAGCT 
8001 GGCCCTGTAG ATCAAACAAT 
8051 GGAGGCTACA GCTCCTCCAT 
8101 CGGGTATGGA AGGCTTCAGG 
8151 GATGGGAGGT GTTTGGGGGC 
8201 GGTACTGGCT GGGGATATTG 
8251 CTCCGGAGTG TCCACGCCTA 
8301 CAGCTTTGAG TACGTCAAGA 
8351 GAGAGGCTGG AACACAGTCC 
8401 GCCAGTCCCT GTGAAAAGGG 
8451 GTGTTCTAAG ATTTCCACAC 
8501 TCCCATTTCT GTCAACTCAT 
8551 GTAGTGCAGT AGTTAAGAAC 
8601 ACTCAGCTGC ATGACTTGAA 
8651 CTGTAAAATG GGGATAACAG 
8701 ATCAGAGAAT ATAACACCTC 
8751 CTACGGTGGT GGTAGTATTG 
8801 AGCGGTTGTG GGGC TG AC AG 
8851 GTGATCGGAA CCTCAGAGAC 
8901 CCTGCAGCGC GGACGCGTGA 
8951 GCAAGACCTG GAAGTGCGGC 
9001 CACATCCTGC TGCTCTTCAG 
9051 TTGCAAGCAC GTGCACGCTG 
9101 ACCGCTGCGC CATCATGTGA 
9151 CTGGCCGAGC GGAGGGCGGG 
9201 GCGGGAAATG GAACTGTGAC 
9251 ACCTCCCGGT GAGAAGCAGA 
9301 GCCCTGCCCC TTGCCCCCGT 
9351 TATTTAGTGC AGTGCCCGGC 
9401 GATGGGGGTG AGCGTGCAAT 



GGAAGCTGAG ATTATCAGGT GCTTGCAACT 
AATGCATGGT TGAGATGCAG GTGGCTGAGC 
CCCTTTCTAT TGCTCCTTCT CTGTCTTGAC 
TACAGTTTGG AACCCTGCTT CCTCCCTTCA 
CTAGCCAATC AGGGGCCTTC CTCCTCTCTC 
GAGTGGGTGG GAGGAAGCCA GGAGTTCCCC 
GGGAGAGGTG GGGCTGTCAC CCTCAAAAGC 
CAGCAGACAG CTTGAAGAGA CTGGGAGCTT 
CTGATCTCCA TTGGTCTTGG GGGATCGTGG 
TCCAGGCTCC TGGAGATCAC TGTGTCCAGC 
CTAAGAAACC AGTTTCCTAT GGTTTTAGGT 
CCGCACCAAA AATTTAAACC TCAGCACAAA 
CCCTAGGGAA ATCCACTGCA GCATCTTCTA 
CTGTTCTGAA GTTGGACTTA ACTCTGCACT 
CTTTTGATCC TTCTTGGAAG TGGAGAACTG 
CCCCTGCCCC ACTCAATTCA GATGCTGGGA 
TCTTCTAGTC TTTTGCCTGG GCTTTGGTGG 
TGTCC TTGGA GGCCTCTGTC CCCCCACCTT 
C AC AC TGGC T GCCTGAAACC GCTCTTGCAG 
TC TGTTAAGG AAACTTCTCT TTAGTGTACT 
TCACCTTGAG CTGTACCCCC CAATCCCTTG 
AGGAAGTTCA TCTCCCTGGA GCTGCCCAGG 
CATTATGCAT CCAGTCTGAA TCTTTTTCTG 
AGAGAAGCAT ACAGAAGGGG CACATCAGGT 
AGAGGCAGGA GTTGCATTGA TCCCAGCTTG 
CCCAGGTAGT GCTATTAAGG AGCACCTGCT 
ACTCAGGGAG GAAGAGGGAG TATATACAAA 
CCTTCTCAGG AGCTCACAGC AGAAGTGGGG 
TAATGAAAGT GTAATAGAAG GATTGTTAGA 
AGTCCCCTGC TTGCAGGAAG GATGGGAAAT 
GTGGCACTTT AGCTGAACCT TGGAAGATAG 
CCAGCTCAGA GCTGGACTCT TAGAGGTACT 
CAAAGGCCTC GATGGATGGA TGAGGGCACG 
AAACTCCAGA TGTGCACAAG ACAGTGCAGT 
GTGACCTGCT CCATCCTGGC TTGGGAATGG 
TCTCCCTGGG CCTGGTCTCC TGGGGATGGT 
TGCAGTGGCA GGTGAGAGCA CTGCCCCTCT 
TAGGGGAGCC CTCATGGCTG CTCTGACCCT 
CAGGAGTGGG CAGACACCTG CTGCAGGGGA 
CATCCTGGTC TACGACATCT GCTGCTTTGA 
CCATCCGCCA GCAGATCCTG GAGACGAGGT 
ATTGCCACCT CTGTGGATGC CCCAGTGCTA 
CACAGTATAG GGACACAGAT AGAGGTATAT 
AT AC AC TC AA ACATGCATAC ATTGTGCTGT 
GTTGGGACCG TGGCTGTGGG GGTGGCTAGA 
TGGGACTTCT GGAACAAGAC TTCCAGGGCC 
GCCAGTAAAC ATTTAAGCCT ATGTCCTCAT 
TAGAACCCAT CTTTTAGATC AGTTGTGCTG 
CAGGGCTTAG GGCTGCGCCT GGAGCAGAAC 
GCCAGGCACA GCCTGCCCTG CTGGGAGTAC 
AGTTCTGAGC TGCCTGCCTC GCCCCACAGG 
GCCCATCATC ATCGTGGGCA ACAAGCGGGA 
TCCCGCGCTG GAACGTGTCG CACCTGGTAC 
TACGTGGAAT GCTCGGCCAA GTACAACTGG 
CGAGCTGCTC AAGAGCGTCG GCTGCGCCCG 
CCCTGCGCTT CCAGGGCGCG CTGCGCCGCA 
CGCCTGCGCG CCCCTCGGGC TGCACCGGCA 
GCCGTACTGC GGGGCTGGGG CGGGGAGCGG 
GGTCCCGGCC TGAGGCCCCT GCAGCCACGC 
GCGCGAGAGG GAGCCCTCCG TAACTGCCCA 
GGCTTCCTGG GACAGCCGCC TTCAGTGCTG 
CCGACCCGCG GGGGTGCCAC AGCCTTTTGG 
GGAGGCTGGG GGTGGCGAGG TGCCGCCTTG 
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9451 GCCGGGCCCC CACGTGTCTT CTCCAGAATG TGTCTGTCTT TGCCTGGTGT 
9501 CTTCCTTTCC CGTGTCCGCC CACCCCAGCG TCTGTTGGTA CTTACCTGTC 
9551 TCACCTACCC TCCAGTCCCC TCCCAGCTCC GCTCACAGGG CTCTCATTTC 
9601 GTCCATCCCC TTGTCGCAGA TCCTGGCAGC TTCTTTGTGA GGCCAGGCCT 
9651 TCTGACTGTC AGCACCACCG GCACAGGGCA GAGATGCGGG TGGCCCAAGG 
9701 ACCACGATCA AGGGGTCCGG GGGACCGAGG TCCCAGATCA GTGAGGGGAG 
9751 AAGGTTGAGC TCTCCGGCTT CCAGGGAGAC CTCCCCGCCC AGCAGCCCCC 
9801 AGAGACACAA CAACCTACCT TCCAGCCTTA ACTCGATGGT CCGTCCCTGC 
9851 CAGGTGCCCC TCACTCTTCC TGACCCCAAA GCCAGATCAC CCCCTGGGTT 
9901 AAAACTTTTT TTCTTTTTTT TTTTTGGACA GAGTGTGGAA AGGGAGCCCC 
9951 CCAAAGGATA GCTTCTTTTT CATGATGCCA GGCTCCAGTC CTTTATTCCC 
10001 TTCTGCATAC TGCAATCTGA TCTGTCAGAC TGGGGAATGT TGGGTTCTGG 
10051 GGTCTGGTCG TGGGCAGGAT GGTGCCCAGA AGGGGGTTAG GTTGTCCCAG 
10101 TGAAAATTCT GTTGCCCCGT CTCAACCCCA TCTGACTACC CCAGACTCTG 
10151 CCTGCCTCAG ATCTCAGACT ATCCTGATTA ATCTGGGGAA GAACAGAGCC 
10201 AGGGAAAGAA TGGTGGGGAC CCCTGTACTT GGGGGAGACA CACCTGCATC 
10251 TTCCTCCTGC CACAGATGGA GGCCCTCAGG ATCTGACACC CTCTTGTCCC 
10301 AACACCAGTC AGCCCTATAC CCTAACTCAC TCCACCCCAT TTTCTCCGGC 
10351 TGCCTGGCCG GGTTTCTACC TCTCGTCACC GGAGCTGATC ACTGTCAGTT 
10401 TTGTACCGAT TTAGAAATAA CAATAATAAT GAAGATTCTA GGAATGGCAT 
10451 GAGGGATTGA TGGGGGACTT GGAGGGAGGG ACAAGTGGTG CCCTGTCCCC 
10501 TGCTCCCCTG GCCAAAGAAA GCTGTCCTTG AGGCTGAGCC CTCAGCCCTG 
10551 GCCTGGTGGG GGGACAGCAA GGTCCCTTGT TATAAGAGGG GCAGAGAGGA 
10601 CAACTCCGCT TTGGCCAACC TAGCCAAGGC TGCAGCATAT AGACCAGGAA 
10651 ATCAGGTAGC CCAGACTGGT GATGGAGCAG AGTCTGGGGG AAGGGTCGTG 
10701 GGTGGGGAAT TTATCACCAA CATCCATTGT AGGGGGAATC TATGATTCTG 
10751 CTTCCCCAGC GGATTCCCAC TCTGTCCACC AAGTGGGGGG TAGCACAGCC 
10801 TCACAGCAAC CGCCCTGACC TTGGGCAGTC TAGTGTTCCT GCATTCTAGT 
10851 CCCTGCTGTG CTGCAGGACT TTGGGCAAGT GACCTGCCCT CTGTGAGCCT 
10901 CCCTCTGACA CAGAGGAGGT GGCTCCCCTT CCCCACACCT TAGAGTGGCT 
10951 GGGAGGGTAA CAAAGAGGGC CTGCCCCTTT AGTCTCCTGC ACCCCTGGCC 
11001 CCTGGTTCAC CAGAGGGAGC GGATGAAGGA TGGCAGCATC TCACATGCCC 
11051 CATCACCAAC TCTGAGGCAC CTGGGGTGGG GGGGCGGAGC CCAGGCCTCT 
11101 GGCTGCTCCC CTGTGGGAGC CATTGGAATG TATCCCCTGA CAGGCCCCCT 
11151 TCCGCCTCCA CCTCAACCCA GGTCTTGGAT TTCAGGTCCC TCCACCCCCA 
11201 TTCTGAGTCT CTGTCCTTCT CCTTCCACCC GCTCCCAGGG TTTCCCACCA 
11251 CAGGGTCTGG AAGTGTGTGT GACGCCCATT GAGCTGTTAC CCGAAGTCAG 
11301 ATTAAAAATC AGGGAGTGTT TTCCCTCGTT TCTGTACCAA GGTGTTGGCT 
11351 CCATTCCTCA TGGTAGGAGG GGAGGGGTCC CCACAGGGCT TGCCTGCTGA 
11401 GCTCCGTGTG GAAGGAGGGT GAAGGTGGTG AGGTGGCCCC CAGTCCCAAA 
11451 GCCCAGGTCA ACAGGGAGAC CACCGGTGAA GAGTTTGGGA TTTATCACCT 
11501 TTCCACCTAA CCCCAAACCC TCCAGCTAAT TCCAACCATT CAGAAGGGAA 
11551 GCAGAACTTC TCCCCTGCCA CTGTCTGGAA AATTTCCATA ATGGGACTCA 
11601 ATCCCAGCTT CTCCGTCTGC GTCTCGTCCT TCCCACTCAA GGC TGAGACT 
11651 TTACAGCCTC TCAGTCATAA CTTCTTGGAT GTAGATGTGT TAGGAACACT 
11701 TTCAGCCACC CGTCTTGTCC CTGAGTGATC TCAGGTCCCA AACTCCAGAG 
11751 CAAAGCTTTG AAATCTTGGG CAAGGGTGCC TTGTGGGAGC CTGTGTGTTG 
11801 AGGGCAGGAC TGGTCTCTGT CCGTGGTGCT GACCCACCAG CCACTTCCAG 
11851 GAAAGATGGG GCTGCCTGGC AAGGTTGGCT GAGC CTC AAA AGAGGAAGCC 
11901 TCTCTCACCA CCAACTCCTT CCTTCTAGTC CCCATCTCCT CCAGTGGGAT 
11951 AACATCTGAA GCTATACCTC CCCGCACCAC CACAGTCCTG GAGTGAGGGA 
12001 CTCAAGAAGC TGGGGGGCAG GGGGAGGCAG GTTCAGTGGT TCACATCTTT 
12051 AATCCCACTG CTTTGGGAGG CCAAGGCAGG AGGATCGCTT GAGGCCAGCC 
12101 TGGACAACAT AGTAAGAC (SEQ ID NO:3) 

FEATURES: 

Start: 3000 

Exon: 3000-3215 

Intron: 3216-8223 

Exon: 8224-8348 

Intron: 8349-8849 

Exon: 8850-9117 

Stop: 9118 
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CHROMOSOME MAP POSITION: 

Chromosome 17 



ALLELIC VARIANTS (SNPs) : 
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G 
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G 


C 


Beyond ORF (3' ) 


11079 


G 


A 


Beyond ORF (3' ) 


12087 


A 


G 


Beyond ORF ( 3 ' ) 



Context: 



DNA 

Position 

2455 CTCCTTCTAAGCTTCTCTCTAAAGTGGGAATAACAGTACTTATCCCTTAGAGATGTCGTA 
CTGACCAAGGATAACCATATGGGCATGCCAGCAGAGTAAGTGCTCAGCAGTGTTGCACCC 
CAGAGGCGACGTGGTCATCTAGTGCAGCATTCTCAACTAGGGGCATTTGGAAATGGGGGC 
AGGGGGAGTTTTTGGTCATCATGGTGTCTGGAGGCTGGCAGTTACCTCTTGGGGGCCACG 
GAAAGCAAATGTCCTGCAATGTGTGGGCAGTCCTGCACAATCGAGTTCTCTCCCCACAAA 
[A,C] 

TGTCAGGAGTGCCAGCACTGAGACACACCTGCCCAGTCCCACCCATTCAGGAGGACACAG 
ACTCAGAGGTGTTGCCGTCTTGTCCCAGGCTCTGTGGGGAAGCTGGGATCAAACCAAGTC 
CAGTGCGCTTCCCACTCTGCTCTGCAGCCTGTTTTGGTTGGAGTTGGACCTGGAGAAAAG 
TCAAGTCATAAGTCAAGAAAGATTGGGCCCTACTACTGGAATGCAGGAAAAAATGGAGGA 
GGGATGGAGAGGTTTTGGAAAGGCAGCCACAGGGGTTCTGGGAGAGGGAAGGCATTCTAA 



2785 TGCCCAGTCCC ACCC ATTCAGGAGGAC ACAGACTC AGAGGTGTTGCCGTCTTGTCCCAGG 

CTCTGTGGGGAAGCTGGGATCAAACCAAGTCCAGTGCGCTTCCCACTCTGCTCTGCAGCC 
TGTTTTGGTTGGAGTTGGACCTGGAGAAAAGTCAAGTCATAAGTCAAGAAAGATTGGGCC 
CTACTACTGGAATGCAGGAAAAAATGGAGGAGGGATGGAGAGGTTTTGGAAAGGCAGCCA 
CAGGGGTTCTGGGAGAGGGAAGGCATTCTAAGTGGCAGTAACAGCTTCAGCAAAGTCCCA 
[A,G] 

AGGTGGAAAAGTGCAGGACACGTCCAGGGATAAGCCAGTGCACTAAGCCCACCTCTTGTC 
CCCACAGTCCAGGTGGAGGCCGCAGAGGGCCCAGGGCAAGCAGAGGCAGCAATGGTTGGT 
CCTGACGGTGGCTGAGCCCCCAGCCCCTGGAATATGCAGCCCGGGGGAGCCCCAGACAGC 
GGCAAGGACGAGGTGGCGGAGTGGGGCGGGAGGCATGGTCTCCACCTACCGGGTGGCCGT 
GCTGGGGGCGCGAGGTGTGGGCAAGAGTGCCATCGTGCGCCAGTTCTTGTACAACGAGTT 

3482 ACCCATCAGCGCCTTCCCTGTCAATACGCTCCAGGTAGGAGGACCCTGGGGGGCATGGGT 
TAGTGGGGAAACGGATGGGTAGGGGAGAGGCTGGATTCCAAACTGCTGTAGCTTGGGCCC 
TATTGCCAGGGCCCCATCACTGAGTTTGGGAGCTCCACACTGCACCTTGGGCCACTCTGC 
TTAGAGCCGTTCCAGGAATCCATTCATTGGTGTGCTAGTTTATTCAACAAATATTTGGTG 
ACCGTTCAATGTGTGCCAGGCCCTGCAGTGGGCACTGGTGCAGAATGGTGAGCAAAAAAT 
[T, A] 

TATGGAATTTGCTTTCAAGAAACTCATAGTCTGGTGAGAAAAGGCAAATATGGTGTGATA 
AGTTCTATGATTGGAGGAGCAGGGAGCTGGGGCAGCCCTTAAGGGGGCATCTAGGCCATC 
CAGATGTGTTGGGGTGGAGTTGGGGGGTCACAGAGGGTGATGTCTCAACTAAATAGGTTT 
TAGGCAGGTAAGAGTCAGTAGAGAAAAGGACAGGGAACACTAGGCTACTGTGAGTATTCG 
GAGCTGTGCCTACCGTAACCTCACTCCACATCCTCTGGAGAAGGGACAGCAGCAGAACAG 



6189 GAC AATCTGAGAGATGAGATATGATATTCTCCTTGTAGTCTCCATTTGCATCTCTGATCG 

TGGGTGAAATTGAGCATCTTTCATAGGTTTAAGGGCCTTTGTGTTTCTCTTTTCAAGAAC 
TATTGATGTCCTTTGCCCATTTTTCTATTGGGTTGTTGGCTTTTTTCTTCTTGACTGACC 
CTGAGTTTTGGACTCTAAGATATCCAAGATTTCACTCCTGGAGCCCAGTAAGGGACTTTT 
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GGCAGAGAAATACTGTGAAAAAGGTATCCTCAAGGCACCAAAGATTAAGTATAAAACCTA 
[A, G] 

GAATCCTGATGGCCACCATCTGGAAACAAAATAATACATTCTTCTCCAATGCCAGATGAG 
ATAGAGCCCAGGAGAGTAGTGTTTCCTGGGTGTGAGCCTCAGTGTCTTCTGCAGCCCCTT 
CTATCAGAGAAGGAAGCTGAGATTATCAGGTGCTTGCAACTCACCAAAGGAATTATCAGC 
AAATGCATGGTTGAGATGCAGGTGGCTGAGCCTTGTCCCTGAAACTGGACTCCCTTTCTA 
TTGCTCCTTCTCTGTCTTGACAGAGCCCCAAGATGGCCTTTTACAGTTTGGAACCCTGCT 

6491 AATCCTGATGGCC ACCATCTGGAAAC AAAATAATAC ATTCTTCTCC AATGCCAGATGAGA 

TAGAGCCCAGGAGAGTAGTGTTTCCTGGGTGTGAGCCTCAGTGTCTTCTGCAGCCCCTTC 
TATC AGAG AAGGAAGC TG AGATTATC AGGTGC TTGC AAC TC AC C AAAGGAATTATC AGC A 
AATGC ATGGTTGAGATGC AGGTGGCTGAGCC TTGTCCCTGAAACTGGACTCCC TTTCTAT 
TGCTCCTTCTCTGTCTTGACAGAGCCCCAAGATGGCCTTTTACAGTTTGGAACCCTGCTT 
[T,C] 

CTCCCTTCAATCAAGGGGGAAGGGATAAGCTAGCCAATCAGGGGCCTTCCTCCTCTCTCT 
TTTAGGAACCCCCAGAGAGGAGTGGGTGGGAGGAAGCCAGGAGTTCCCCTCAAGGAGGCA 
ACATGTTGGGGGAGAGGTGGGGCTGTCACCCTCAAAAGCTGGCAGCTGCTCCCTCTCCCC 
AGCAGACAGCTTGAAGAGACTGGGAGCTTCTCATCCCTCCCACTTCTCACTGATCTCCAT 
TGGTCTTGGGGGATCGTGGGAGCATCCGTATACACAGGTTCCAGGCTCCTGGAGATCACT 

7353 GTCCTCCTTTGTGCCTGGCCCCTGCCCCACTCAATTCAGATGCTGGGACAGGAGACATAC 
CTCCACCTTCTTCTAGTCTTTTGCCTGGGCTTTGGTGGGAGAAGACTCTGGTTTCCTTTG 
TCCTTGGAGGCCTCTGTCCCCCCACCTTTAGGGACCCCCTTCTTTCCACACACTGGCTGC 
CTGAAACCGCTCTTGCAGCTGGCACGTTGACTAATGAATCTGTTAAGGAAACTTCTCTTT 
AGTGTACTTGGCCTTTCTAGGAGTCTCTTCACCTTGAGCTGTACCCCCCAATCCCTTGAG 
[A,T] 

AGTTGCCACAAACATTCAGGAAGTTCATCTCCCTGGAGCTGCCCAGGGGCCCTACTCTAC 
ATCAGCCCATTATGCATCCAGTCTGAATCTTTTTCTGTTTCTCAACCCTGAGGGCAGAGA 
GAAGCATACAGAAGGGGCACATCAGGTAGCAGTCTAAGGGCAGTGGCAGAGGCAGGAGTT 
GCATTGATCCCAGCTTGGGCCATGGAGAGCTCACCAGCCCAGGTAGTGCTATTAAGGAGC 
ACC TGCTTTGAGCC AAC AGTGCTAGAC AC TCAGGGAGGAAGAGGGAGTATATACAAATGA 

8688 TGCCCCAGTGCTAGCCAGTCCCTGTGAAAAGGGCACAGTATAGGGACACAGATAGAGGTA 
TATGTGTTCTAAGATTTCCACACATACACTCAAACATGCATACATTGTGCTGTTCCCATT 
TCTGTCAACTCATGTTGGGACCGTGGCTGTGGGGGTGGCTAGAGTAGTGCAGTAGTTAAG 
AACTGGGACTTCTGGAACAAGACTTCCAGGGCCACTCAGCTGCATGACTTGAAGCCAGTA 
AACATTTAAGCCTATGTCCTCATCTGTAAAATGGGGATAACAGTAGAACCCATCTTTTAG 
[A,G] 

TCAGTTGTGCTGATCAGAGAATATAACACCTCCAGGGCTTAGGGCTGCGCCTGGAGCAGA 
ACCTACGGTGGTGGTAGTATTGGCCAGGCACAGCCTGCCCTGCTGGGAGTACAGCGGTTG 
TGGGGC TGAC AGAGTTCTGAGC TGCC TGCC TCGCCCC AC AGGGTGATCGGAACC TC AGAG 
ACGCCCATCATCATCGTGGGCAACAAGCGGGACCTGCAGCGCGGACGCGTGATCCCGCGC 
TGGAACGTGTCGCACCTGGTACGCAAGACCTGGAAGTGCGGCTACGTGGAATGCTCGGCC 

10789 TGCCCTGTCCCCTGCTCCCCTGGCCAAAGAAAGCTGTCCTTGAGGCTGAGCCCTCAGCCC 
TGGCCTGGTGGGGGGACAGCAAGGTCCCTTGTTATAAGAGGGGCAGAGAGGACAACTCCG 
C TTTGGCC AACCTAGCC AAGGCTGC AGCATATAGACC AGGAAATC AGGTAGCCC AGAC TG 
GTGATGGAGCAGAGTCTGGGGGAAGGGTCGTGGGTGGGGAATTTATCACCAACATCCATT 
GTAGGGGGAATCTATGATTCTGCTTCCCCAGCGGATTCCCACTCTGTCCACCAAGTGGGG 
[G,C] 

GTAGCACAGCCTCACAGCAACCGCCCTGACCTTGGGCAGTCTAGTGTTCCTGCATTCTAG 
TCCCTGCTGTGCTGCAGGACTTTGGGCAAGTGACCTGCCCTCTGTGAGCCTCCCTCTGAC 
ACAGAGGAGGTGGCTCCCCTTCCCCACACCTTAGAGTGGCTGGGAGGGTAACAAAGAGGG 
CCTGCCCCTTTAGTCTCCTGCACCCCTGCCCCCTGGTTCACCAGAGGGAGCGGATGAAGG 
ATGGCAGCATCTCACATGCCCCATCACCAACTCTGAGGCACCTGGGGTGGGGGGGCGGAG 

11079 CCAAGTGGGGGGTAGCACAGCCTCACAGCAACCGCCCTGACCTTGGGCAGTCTAGTGTTC 
CTGCATTCTAGTCCCTGCTGTGCTGCAGGACTTTGGGCAAGTGACCTGCCCTCTGTGAGC 
CTCCCTCTGACACAGAGGAGGTGGCTCCCCTTCCCCACACCTTAGAGTGGCTGGGAGGGT 
AACAAAGAGGGCCTGCCCCTTTAGTCTCCTGCACCCCTGCCCCCTGGTTCACCAGAGGGA 
GCGGATGAAGGATGGCAGCATCTCACATGCCCCATCACCAACTCTGAGGCACCTGGGGTG 
[G, A] 

GGGGGCGGAGCCCAGGCCTCTGGCTGCTCCCCTGTGGGAGCCATTGGAATGTATCCCCTG 



FIGURE 3F 



ACAGGCCCCCTTCCGCCTCCACCTCAACCCAGGTCTTGGATTTCAGGTCCCTCCACCCCC 
ATTCTGAGTCTCTGTCCTTCTCCTTCCACCCGCTCCCAGGGTTTCCCACCACAGGGTCTG 
GAAGTGTGTGTGACGCCCATTGAGCTGTTACCCGAAGTCAGATTAAAAATCAGGGAGTGT 
TTTCCCTCGTTTCTGTACCAAGGTGTTGGCTCCATTCCTCATGGTAGGAGGGGAGGGGTC 

12087 GAGCCTGTGTGTTGAGGGCAGGACTGGTCTCTGTCCGTGGTGCTGACCC ACC AGCC ACTT 

CC AGG AAAG ATGGGGC TGC C TGGC AAGGTTGGC TG AGC C TC AAAAG AGG AAGCC TC TC TC 
ACCACCAACTCCTTCCTTCTAGTCCCCATCTCCTCCAGTGGGATAACATCTGAAGCTATA 
CCTCCCCGCACCACCACAGTCCTGGAGTGAGGGACTCAAGAAGCTGGGGGGCAGGGGGAG 
GCAGGTTCAGTGGTTCACATCTTTAATCCCACTGCTTTGGGAGGCCAAGGCAGGAGGATC 
[A, G] 

CTTGAGGCCAGCCTGGACAACATAGTAAGAC 
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